AITopics | state-of-the-art model

Collaborating Authors

state-of-the-art model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Appendix of Modeling

Neural Information Processing SystemsApr-25-2026, 13:47:11 GMT

To create a passage representation, the passage title and text are concatenated ([CLS]title [SEP]passage [SEP]), following common practice (Karpukhin et al., 2020). We retrieve top 10 passages and use them as input to mGEN. We differentiate those paragraphs from the question using special tokens (

vs. He graduated with a B.S. degree in Biology in 1957. As in the case of machine translation, we found that the language code does not need to be specified during inference as our model learns the question language automatically. Yet, we found that training with language codes is particularly useful to augment training data for Ltarget without any question data in Ltarget.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: North America > United States > New York (0.14)

Industry:

Leisure & Entertainment (0.93)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.89)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.34)

Add feedback

d6288499d0083cc34e60a077b7c4b3e1-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-14-2026, 10:30:09 GMT

estimation, hawke, point process, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.32)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Appendix

Neural Information Processing SystemsFeb-8-2026, 08:06:05 GMT

We limit the target languages for this augmentation process to Arabic, Finnish, Japanese, Korean, Russian, Spanish, Swedish, Hebrew, Thai,Danish,French,Italian,Dutch,Polish,andPortuguese. Interestingly,justaddingthislanguage code effectively changes the outputs as shown in Table 7. We further subsample 50% of the synthetically generated questions. During inference, we first retrieve top 15 passages using mDPR, and then feed the questions andconcatenated passages intothemGEN model, withlanguage tags. The gray dots concentrated in the lower right part in the first figure represent encoded Thai embeddings.

artificial intelligence, state-of-the-art model, trans, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback

6dd4e10e3296fa63738371ec0d5df818-Paper.pdf

Neural Information Processing SystemsOct-3-2025, 04:41:22 GMT

artificial intelligence, c-hmcnn, machine learning, (16 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.28)

Genre: Research Report (0.69)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

the MCMC perspective, we could treat these (already learned) models as proposals for the approximate MH-algorithm

Neural Information Processing SystemsOct-3-2025, 02:33:22 GMT

We thank all of the reviewers for their valuable feedback and detailed comments. That is "improvement and justification of any implicit sampler". We know that in practice, even state-of-the-art generative models yield "unrealistic" samples, hence, are biased. (Algorithm 3). Based on our theoretical analysis, we derive different losses for the discriminator (Table 1 in the paper).

artificial intelligence, machine learning, proposal, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.43)

Add feedback

A Transformer-Based Approach for Diagnosing Fault Cases in Optical Fiber Amplifiers

Schneider, Dominic, Rapp, Lutz, Ament, Christoph

arXiv.org Artificial IntelligenceSep-5-2025

--A transformer-based deep learning approach is presented that enables the diagnosis of fault cases in optical fiber amplifiers using condition-based monitoring time series data. The model, Inverse Triple-Aspect Self-Attention Transformer (ITST), uses an encoder-decoder architecture, utilizing three feature extraction paths in the encoder, feature-engineered data for the decoder and a self-attention mechanism. The results show that ITST outperforms state-of-the-art models in terms of classification accuracy, which enables predictive maintenance for optical fiber amplifiers, reducing network downtimes and maintenance costs. In present optical transmission links, optical fiber amplifiers are key components in long-haul and metro fiber optical networks. Aging of these devices can result in slowly but permanently increasing performance degradation, but also complete outage of the affected link, resulting in cost-intensive maintenance and high financial loss of income.

artificial intelligence, diagnosis, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICTON67126.2025.11125083

2505.06245

Genre: Research Report > Promising Solution (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

IAUNet: Instance-Aware U-Net

Prytula, Yaroslav, Tsiporenko, Illia, Zeynalli, Ali, Fishman, Dmytro

arXiv.org Artificial IntelligenceAug-5-2025

Instance segmentation is critical in biomedical imaging to accurately distinguish individual objects like cells, which often overlap and vary in size. Recent query-based methods, where object queries guide segmentation, have shown strong performance. While U-Net has been a go-to architecture in medical image segmentation, its potential in query-based approaches remains largely unexplored. In this work, we present IAUNet, a novel query-based U-Net architecture. The core design features a full U-Net architecture, enhanced by a novel lightweight convolutional Pixel decoder, making the model more efficient and reducing the number of parameters. Additionally, we propose a Transformer decoder that refines object-specific features across multiple scales. Finally, we introduce the 2025 Revvity Full Cell Segmentation Dataset, a unique resource with detailed annotations of overlapping cell cytoplasm in brightfield images, setting a new benchmark for biomedical instance segmentation. Experiments on multiple public datasets and our own show that IAUNet outperforms most state-of-the-art fully convolutional, transformer-based, and query-based models and cell segmentation-specific models, setting a strong baseline for cell instance segmentation tasks. Code is available at https://github.com/SlavkoPrytula/IAUNet

artificial intelligence, machine learning, segmentation, (18 more...)

arXiv.org Artificial Intelligence

2508.01928

Country: Europe (0.28)

Genre: Research Report (0.50)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.69)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Apple Is Pushing AI Into More of Its Products--but Still Lacks a State-of-the-Art Model

WIREDJun-10-2025, 00:22:26 GMT

Apple continued its slow-and-steady approach to integrating artificial intelligence into devices like the iPhone, Mac, and Apple Watch on Monday, announcing a raft of new features and upgrades at WWDC. The company also premiered the Foundation Models framework, a way for developers to write code that taps into Apple's AI models. Among the buzzier AI announcements at the event was Live Translation, a feature that translates phone and FaceTime calls from one language to another in real time. Apple also showed off Workout Buddy, an AI-powered voice helper designed to provide words of encouragement and useful updates during exercise. "This is your second run this week," Workout Buddy told a jogging woman in a demo video.

apple, artificial intelligence, natural language, (10 more...)

WIRED

Genre: Research Report > Promising Solution (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.36)

Add feedback

d6288499d0083cc34e60a077b7c4b3e1-AuthorFeedback.pdf

Neural Information Processing SystemsJun-1-2025, 17:47:18 GMT

We thank all the reviewers for their efforts and constructive comments, which help improve the quality of our paper. Based on the analysis of the first and second moment of the estimator presented in Theorems 5.1 and 5.2, a Chebyshev's type error bound can be easily [obtained:] We will present it as a corollary in the final version. We will add this discussion to the final version. Better figure representing MSE vs p. Thanks for the suggestion and we will revise our paper accordingly. It shows that RMSE decreases as p increases.

estimation, hawke, point process, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.32)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

Sun, Haoxiang, Min, Yingqian, Chen, Zhipeng, Zhao, Wayne Xin, Liu, Zheng, Wang, Zhongyuan, Fang, Lei, Wen, Ji-Rong

arXiv.org Artificial IntelligenceMar-27-2025

In recent years, the rapid development of large reasoning models has resulted in the saturation of existing benchmarks for evaluating mathematical reasoning, highlighting the urgent need for more challenging and rigorous evaluation frameworks. To address this gap, we introduce OlymMATH, a novel Olympiad-level mathematical benchmark, designed to rigorously test the complex reasoning capabilities of LLMs. OlymMATH features 200 meticulously curated problems, each manually verified and available in parallel English and Chinese versions. The problems are systematically organized into two distinct difficulty tiers: (1) AIME-level problems (easy) that establish a baseline for mathematical reasoning assessment, and (2) significantly more challenging problems (hard) designed to push the boundaries of current state-of-the-art models. In our benchmark, these problems span four core mathematical fields, each including a verifiable numerical solution to enable objective, rule-based evaluation. Empirical results underscore the significant challenge presented by OlymMATH, with state-of-the-art models including DeepSeek-R1 and OpenAI's o3-mini demonstrating notably limited accuracy on the hard subset. Furthermore, the benchmark facilitates comprehensive bilingual assessment of mathematical reasoning abilities-a critical dimension that remains largely unaddressed in mainstream mathematical reasoning benchmarks. We release the OlymMATH benchmark at the STILL project: https://github.com/RUCAIBox/Slow_Thinking_with_LLMs.

benchmark, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2503.2138

Country: